Candidate Search and Elimination Approach for Telugu OCR
نویسندگان
چکیده
In this paper we propose an OCR system for Telugu based on the candidate search and elimination technique. The initial candidates for recognition are found by applying a zoning method on input glyphs. We propose cavities as a structural approach suited specifically for Telugu script, where cavity vectors are used to prune the candidates found by zoning. A final template matching stage using controlled non linear normalization is performed to conclude the search process. The search can be concluded when at any stage ever an unique candidate is found. A recognition accuracy of 9798% was achieved on real images scanned from Telugu literature.
منابع مشابه
Multi-font Optical Character Recognition System for Printed Telugu Text
The Telugu OCR systems available in the market currently recognize only the specific fonts of Telugu. This paper describes the development of a multi-font OCR system for printed Telugu characters using Artificial Neural Networks. In this system classification of the characters is carried out using multi layer neural network Architecture.
متن کاملOptical Character Recognition (OCR) for Telugu: Database, Algorithm and Application
Telugu is a Dravidian language spoken by more than 80 million people worldwide. The optical character recognition (OCR) of the Telugu script has wide ranging applications including education, health-care, administration etc. The beautiful Telugu script however is very different from Germanic scripts like English and German. This makes the use of transfer learning of Germanic OCR solutions to Te...
متن کاملAn Overview of Optical Character Recognition Systems Research on Telugu Language
This paper gives an overview on the development process and ongoing research of the optical character recognition (OCR) systems for Telugu Text. The aim of this paper is to provide a starting point for the researchers entering into this field. In this paper, we present the introduction, characteristics of the Telugu language, developmental process of the OCR systems of Telugu language, research...
متن کاملSegmentation of Touching Hand written Telugu Characters by using Drop Fall Algorithm
Recognition of Indian language scripts is a challenging problem. Work for the development of complete OCR systems for Indian language scripts is still in infancy. Complete OCR systems have recently been developed for Devanagri and Bangla scripts. Research in the field of recognition of Telugu script faces major problems mainly related to the touching and overlapping of characters. Segmentation ...
متن کامل